Speech-Music Classification Model Based on Improved Neural Network and Beat Spectrum
نویسندگان
چکیده
A speech-music classification method according to a developed neural system and beat spectrum is proposed achieve accurate of through pre-emphasis, endpoint detection, framing, windowing other steps preprocess collect vocal music signals. After fast Fourier transforms triangle filter processing, the Mel frequency cepstrum coefficient (MFCC) obtained, discrete cosine transform performed obtain signal MFCC characteristic parameters. calculating similarity feature parameters similarity, matrix based on which obtained. The residual structure optimized by adding Swish max-out activation functions, respectively, between convolutional network layers build convolution deepen number layers. connected time series (CTC) used as objective loss function. It applied softmax layer deep optimization for model. pitch input information model realize classification. experiment proves that accuracy design higher than 99%; when iteration reaches 1200, training approaches 0; signal-to-noise ratio 180dB, sensitivity specificity are 99.98% 99.96%, respectively; voice 99%, running 0.48 seconds. has been proven high accuracy, low loss, good special effects, can effectively speech-music.
منابع مشابه
Music content authentication based on beat segmentation and fuzzy classification
Digital audio has been ubiquitous over the past decade. Since it can be easily modified by editing tools, there has been a strong need to protect its content for secure multimedia applications. Previous audio authentication algorithms are mainly focused on either human speech or general audio with music as part of the test data, while special research on music authentication has been somewhat n...
متن کاملElectrocardiogram Beat Classification using Probabilistic Neural Network
The Electrocardiogram (ECG) plays significant role in assessing patients with abnormal activity in their heart. ECG recordings of the patient taken to analyze abnormality and classify type of disorder present in the heart functionality. An Electrocardiogram is a bioelectrical signal that records the heart’s electrical activity versus time. It is used to measure the rate and regularity of heartb...
متن کاملService Classification Based on Improved BP Neural Network
With the development of the Internet, several candidate services have emerged for achieving the same task, most of which are functionally identical but different in non-functional properties. Therefore, these services can be classified into different service-quality levels. The so-called Quality of Service (QoS) comprises a set of non-functional properties that can be used to efficiently classi...
متن کاملAn Improved Fuzzy Neural Network for Solving Uncertainty in Pattern Classification and Identification
Dealing with uncertainty is one of the most critical problems in complicatedpattern recognition subjects. In this paper, we modify the structure of a useful UnsupervisedFuzzy Neural Network (UFNN) of Kwan and Cai, and compose a new FNN with 6 types offuzzy neurons and its associated self organizing supervised learning algorithm. Thisimproved five-layer feed forward Supervised Fuzzy Neural Netwo...
متن کاملmortality forecasting based on lee-carter model
over the past decades a number of approaches have been applied for forecasting mortality. in 1992, a new method for long-run forecast of the level and age pattern of mortality was published by lee and carter. this method was welcomed by many authors so it was extended through a wider class of generalized, parametric and nonlinear model. this model represents one of the most influential recent d...
15 صفحه اولذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Advanced Computer Science and Applications
سال: 2023
ISSN: ['2158-107X', '2156-5570']
DOI: https://doi.org/10.14569/ijacsa.2023.0140706